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Size-biased branching population measures 
and the multi-type x log x condition 
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We investigate the a; log a; condition for a general (Crump-Mode-Jagers) multi-type branching 
process with a general type space by constructing a size-biased population measure that relates 
to the ordinary population measure via an intrinsic martingale Wt- Sufficiency of the x\ogx 
condition for a non-degenerate limit of Wt is proved and conditions for necessity are investigated. 
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1. Introduction 

The x log x condition is a fundamental concept for supercritical Galton- Watson branching 
processes, being the necessary and sufficient condition for the process to grow as its 
mean. In a Galton- Watson process with offspring mean m = E[X] > 1, let Z n denote 
the number of individuals in the nth generation and let W n = Z n /m n . Then, W n is a 
non- negative martingale and hence W n — > W for some random variable W . The Kesten- 
Stigum theorem is given as follows. 

Theorem 1.1. If E[Xlog + X] < oo, then E[W] = 1; if E[X\og + X] = oo, then W = 
a.s. 

Here, log + x = max(0, logs). It can further be shown that P(W = 0) must either be 
or equal the extinction probability, hence E[Xlog + X] < oo implies that W > exactly 
on the set of non-extinction (see, for example, Athreya and Ney (1972)). 

The analog for general single-type branching processes appears in Jagcrs and Ncrman 
(1984) and a partial result (establishing sufficiency) for general multi-type branching 
processes appears in Jagers (1989). Lyons, Pemantle and Peres (1995) give a slick proof of 
the Kesten-Stigum theorem based on comparisons between the Galton- Watson measure 
and another measure, the size-biased Galton-Watson measure, on the space of progeny 
trees. In Olofsson (1998), these ideas were further developed to analyze general single- 
type branching processes and the current paper considers general multi-type branching 
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processes with a general type space. In addition to providing a new proof of a known 
result, size-biased processes also provide tools to further analyze necessity of the a; log a; 
condition. 

A crucial concept for the Lyons-Pemantle-Peres (LPP) proof is that of size bias. If 
the offspring distribution is {po,Pi, ■ ■ • } and has mean m = E[X], then the size-biased 
offspring distribution is defined as 

kpk 

Pk = 

m 

for k = 0, 1, 2, . . . , where we note, in particular, that po = 0. A size-biased Galton-Watson 
tree is constructed in the following way. Let X be a random variable that has the size- 
biased offspring distribution and let the ancestor vq have a number Xq of children. Pick 
one of these at random, call her v\ , give her a number X\ of children and give her siblings 
ordinary Galton- Watson descendant trees. Pick one of v\S children at random, call her 
vi , give her a number Xi of children, give her sisters ordinary Galton- Watson descendant 
trees and so on and so forth. With P n denoting the ordinary Galton-Watson measure 
restricted to the n first generations, P n denoting the measure that arises from the above 
construction and W„ = Z n /m n , it can be shown that the relation 

dP n = W n dP n (1.1) 

holds. Hence, it is the martingale W n that size-biases the Galton-Watson process. The 
construction of P can also be viewed as describing a Galton-Watson process with immi- 
gration, where the immigrants are the siblings of the individuals on the path (vq, v\, . . .). 
Thus, the measure P is the ordinary Galton-Watson measure and the size-biased mea- 
sure P is the measure of a Galton-Watson process with immigration, where the i.i.d. 
immigration group sizes are distributed as X — 1. The relation between P and P on the 
space of family trees can now be explored using results for processes with immigration 
and this provides the final key to the proof. 

The general idea of using size-bias in branching processes appeared before LPP. One 
early example is Joffe and Waugh (1982), where size-biased Galton-Watson processes 
show up in the study of ancestral trees of randomly sampled individuals. This approach 
was further explored by Olofsson and Shaw (2002) with a view toward biological applica- 
tions. An approach similar to LPP appeared in Waymire and Williams (1996), developed 
simultaneously with, and independently of, LPP. Later applications and extensions of 
the powerful LPP method include Kurtz et al. (1997), Geiger (1999), Athreya (2000), 
Biggins and Kyprianou (2004) and Lambert (2007). 

To make this paper self-contained, we give a short review of general multi-type branch- 
ing processes and their a; log a; condition in the next section. As in the Galton-Watson 
case, branching processes with immigration are crucial in the proof; for that purpose, 
we briefly discuss processes with immigration in Section 3, following Olofsson (1996). 
The size-biased measure on the space of population trees and its relation to the ordinary 
branching measure is investigated in Section 4 and, in Section 5, sufficiency of the a; log a; 
condition is proved. Finally, in Section 6, we discuss various conditions for necessity. 
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2. The x\ogx condition for general branching 
processes 

In a general branching process, individuals are identified by descent. The ancestor is 
denoted by 0, the children of the ancestor by 1,2,... and so on, so that the individual 
x = (xi, . . . , x n ) is the x ra th child of the m„_xth child of. . . of the xith child of the ancestor. 
The set of all individuals can thus be described as 



At birth, each individual is assigned a type s, chosen from the type space 5 1 , equipped 
with some appropriate er-algcbra S. The type space can be quite general; usually, it is 
required to be a complete, separable metric (that is, Polish) space. The type s deter- 
mines a probability measure P«( - )> the life law, on the life space fi, equipped with some 
appropriate cr-algebra J- . The information provided by a life u> S Q may differ from one 
application to another, but it must at least give the reproduction process £ on S x R + . 
This process gives the sequence of birth times and types of the children of an individual. 
More precisely, let (r(fe), er(fc)) be random variables on fi denoting the birth time (age 
of the mother) and type of the fcth child, respectively, and define 



for A £ S and t > 0. We let r(fc) = oo if fewer than k children are born. The population 
space is defined as fl 1 , an outcome to 1 of which gives the lives of all individuals, together 
with the er-algebra JF 1 . The set of probability kernels {P s (-), s S S} defines a probability 
measure on (f2 7 , J-" 7 ), the population measure P s , where the ancestor's type is s. 

With each individual x E I, we associate its type o~ x , its birth time t x and its life u) x , 
where o~ x is inherited from the mother (a function of the mother's life) and lo x is chosen 
according to the probability distribution Po-^i') on (f2, J-). The birth time t x is defined 
recursively by letting the ancestor be born at time To = and, if x is the fcth child of its 
mother y, we let t x = r y + r(k). Note that t x and T y denote absolute time, whereas r(fc) 
is the mother's age at x's birth. 

An important entity is the reproduction kernel, defined by 



the expectation of £(dr x dt) when the mother is of type s. This kernel plays the role 
of m = E[X] in the simple Galton- Watson process and determines the growth rate of 
the process as e Qt , where a is called the Malthusian parameter. We assume throughout 
that the process is supercritical, meaning that a > 0. The existence of such an a is not 
automatic; sufficient conditions may be found in Jagers (1989, 1992). For the rest of 
this section, we leave out further technical details and assumptions, instead focusing on 
the main definitions and results. The details can be found in Jagers (1989, 1992) and 



oo 




C(4x [0,t]) = #{fc:cr(fc) eA,r(k) <t} 



/j,{s, dr x dt) = E s [£(dr x dt) 
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we simply refer to a process that satisfies all of the conditions needed as a Malthusian 
process. 

Given [i and a, we define the kernel as 



ju(s,dr)= / e _Q >(s,dr x di) (2.1) 
Jo 

which, under certain conditions, has eigenmeasure 7r and eigcnfunction h given by 

7r(dr) = / //(s, dr)7r(ds), 

^ (2.2) 
ft(s) = / h(r)fi(s,dr), 



JS 

where both 7r and hdir can be normed to probability measures. The measure 7r is called 
the stable type distribution and h(s) is called the reproductive value of an individual of 
type s. The interpretation of w and is that tt is the limiting distribution of the type of an 
individual chosen at random from a population and h(s) is a measure of how reproductive 
the type s tends to be, in a certain average sense. Moreover, after suitable norming, it 
can be shown that h dir is the probability measure that is the limiting type distribution 
backward in the family tree from the randomly sampled individual mentioned above. The 
mean asymptotic age of a random child-bearing in this backward sense is denoted by 
and satisfies 

0=1 te- at h(r)fi(s,drxdt)Tr(ds)<oo. (2.3) 

JSxSxR + 

To count, or measure, the population, random characteristics are used. A random char- 
acteristic is a real- valued process x, where x( a ) gives the contribution to the population 
of an individual of age a. Thus, x is a process defined on the life space and by letting \x 
be the characteristic pertaining to the individual x, the x-counted population is defined 

as 

xei 

which is the sum of the contributions of all individuals at time t (when the individual x 
is of age t — r x ). The simplest example of a random characteristic is x( a ) = Ir+ ( a )? th. c 
indicator for being born, in which case is simply the total number of individuals born 
up to time t. 

To capture the asymptotics of Zf, the crucial entity is the intrinsic martingale Wt, 
introduced by Ncrman (1981) for single- type processes and generalized to multi-type 
processes in Jagers (1989). For its definition, denote a;'s mother by mx and let 

X t = {x:T mx <t <t x }, (2.4) 

the set of individuals whose mothers are born at, or before, time t, but who themselves 
are not yet born at time t. The set X t , sometimes referred to as the "coming generation", 
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generalizes the concept of generation in the Galton- Watson process. Now, let 

W t = j^—- c~ aTv h(cr x ), (2.5) 

the individuals in T t summed with time- and type- dependent weights, normed by the 
reproductive value of the ancestor. It can be shown that Wt is a martingale with respect to 
the c-algebra Tt generated by the lives of all individuals born before t and that E s [Wt] = 1 
for all s £ S. Hence, W t plays the role that W n = Z n /m n does in the Galton-Watson 
process and the limit of Zf turns out to involve the martingale limit W = lim^oo Wt ■ 
The main convergence result is of the form 

ap 

P s -almost surely for 7r-almost all s € S as t — > oo . Here, cto = s is the type of the ancestor, 
E v [-] = J s E s [-]n(ds) and x( a ) is the Laplace transform of x( a ) evaluated at the point 
a. As in the Galton-Watson case, the question is when the martingale limit W is non- 
degenerate. As W t -^W P s -a.s. and -EslW 7 *] = 1, L 1 -convergence with respect to P s is 
equivalent to -E S [W] = 1 (Durrctt (2005), page 258). Note that although it is the process 
Z* that is of interest and not Wt itself, the asymptotics are determined by Wt, one of 
many examples in probability of the usefulness of finding an embedded martingale. 

We are ready to formulate the general a; log a; condition and the main convergence 
result. For the reproduction process £, define the transform 

i = I e- at h(r)Z(dr x dt) (2.6) 

JSxR + 

which plays the role of X in the Galton-Watson process (in fact, in that case, £ = X/m). 
For future reference, let us also state an alternative representation of £. Denote the 
sequence of birth times and types in the process £ by r(l), tr(l), r(2), ct(2), . . . and so on. 
Then, 

OO 

£ = £e-°"-«A(*(i)). (2.7) 

i=l 

The a; log a; condition and convergence result are given in the following theorem from 
Jagers (1989). 

Theorem 2.1. Consider a general multi-type supercritical Malthusian branching process 
with 

£U£log + £]<oc. 
Then, E s [W] = 1 for tt -almost all s, from which it follows that 

ap 
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in L l (P s ) for ir-almost all s. 

3. Processes with immigration 

As mentioned in the Introduction, branching processes with immigration are crucial 
to our proof and in this section, we state the main result for such processes. Consider a 
general branching process where new individuals immigrate into the population according 

to some point process n{dr x dt) with points of occurrence and types (ri, <7i), (t2, 02), 

The fcth immigrant initiates a branching process according to the population measure 
P ak . The immigration process has the transform 



and it can be shown that the process Wt is now a submartingale rather than a martingale 
(which is intuitively clear because offspring of immigrants may be added to the set 
It). The limit of Wt is therefore not automatically finite, but needs a condition on the 
immigration process, established by the following lemma from Olofsson (1996). 

Lemma 3.1. Iffj<oo a.s., thenWt^W a.s. asi— >oo, where W < oo a.s. 

4. The size-biased population measure 

Recall that the LPP size-biased Galton- Watson measure was constructed from the size- 
biased offspring distribution. General branching processes require a more general concept 
of size-bias. In a general process, the offspring random variable X is replaced by the 
reproduction process £, the size of which is properly measured by the transform £ which 
leads to the following definition. 

Definition J^.l. The size-biased life law P s is defined as 



From Jagers (1992), we know that the eigenfunction h is finite and strictly positive, 
so P is well defined. The following lemma follows immediately from the definition of P s . 

Lemma 4.2. Let P s and P s be as above and denote the set of realizations of reproduction 
processes by T, equipped with a a-algebra Q. Then, 




P s (doj) 



CM 



P s (doj). 



h(s) 



(i) for A 6 T , 



Ps{A) 



E s [£ ; A] 
h(s) 
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(ii) for every Q -measurable function g.T —> R, 



h(s) 



Note that P s is indeed a probability measure for all s £ S because 

h(s) 

where E s [£\ = h(s) follows from the definition of £ in (2.6), together with (2.1) and (2.2). 
Also, note that a size-biased reproduction process always contains points because 

PMS x R+) = 0) = -L.E a [Z;Z(S x R+) = 0] = 0, 
h(s) 

in analogy with the size-biased offspring distribution in a Galton- Watson process (£(S x 
i?+ ) is the total number of offspring of an individual) . 

To construct the size-biased population measure, let P* and P* denote the restrictions 
of the measures P s and P s to the tr-algebra Tt ■ The goal is to construct a measure P s 
on (O 7 , J-" 7 ) that is such that 

P t s {Au I ) = W t {u I )P t s {^ 1 ) 

for all t, where Wt is the intrinsic martingale defined in (2.5). This measure is the direct 
extension of the size-biased measures from Lyons et ol. (1995) and Olofsson (1998). The 
construction also involves the set T t , defined in (2.4), whose individuals all have mothers 
that are born up to time t. Thus, the type and birth time of an individual in Tt is 
measurable with respect to Tt , which implies that Wt is also measurable with respect to 

The construction of the size-biased population measure extends the construction in 
Olofsson (1998) as follows. Start with the ancestor, now called Vo, and choose her life wo 
according to the size-biased distribution P s (du;o) = ^jP s (do;o). Pick one of her children, 
born in the reproduction process £oi such that the ith child is chosen with probability 



. Call this child Vi, let her start a population according to the size-biased 

population law P Gi and give her sisters independent descendant trees such that sister j 
initiates a branching process according to the regular population law P (Tj . Continue in 

this way and define the measure P s to be the joint distribution of the random tree and 
the random path (vq,V\, . . .). We shall borrow a term from Athreya (2000) and refer to 
the path (yo, v\, . . .) of chosen individuals as the spine. 

Now, fix an individual x in the set T t defined in (2.4) and consider the probability P s , 
constrained by the individual x being chosen to be in the spine. Specifically, if S(x,t) 
denotes the event that the individual x in Tt is chosen to be in the spine and A 6 J- 1 , 
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then we consider the measure P s (-;x) defined by 

P t s (A;x)=P t s (AnS(x,t)). (4.1) 

Denote by i the individual in the first generation from whom x stems, that is, x = (i,y) 
for some y. Hence, if x is in the nth generation, then it is of the form x = (x±, x%, . . . , x n ), 
where x\ = i, and we let y = {x2, ■ ■ ■ , x n ). In words, y is the same individual as x when i 
is viewed as the ancestor. Let cjV' denote the lives of all individuals when j is viewed as 
the ancestor to obtain 

Hs) Co 11 

where the first factor describes the size-biased choice of life of the ancestor. The second 
factor is the probability that the individual i in the first generation is chosen to be in the 
spine and the third factor describes the size-biased probability measure of the process 
starting from i, constrained by the individual y being in the spine. Finally, the fourth 
factor describes the regular population measures stemming from the individuals in the 
first generation who are not chosen to be in the spine. 

The following proposition states the desired relation between the size-biased measure 
Pi and the regular population measure P*. 

Proposition 4.3. Let Pi and Pi be the restrictions of P s and P s to the a-algebra Tt 
and let Wt be as in (2.5). Then, 

dP* 

— = W t . 

dPl 



Proof. Let P s {-;x) be as in (4.1). Then, 

P*(d^;x) = ^^)p*(d^). 



Further, note that for the regular population measure, we have 

pKdu 1 ) = p s (du Q ) n K; Tj (d^ j) ) 

0=1 

=p s (d W0 )p^ ri (d^ (4) )n< Tj ( dwb) ) 

= e p 5 (dc ) h f ) J p^ (d« w ) n p*p (d « w ) . 
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Now, let T y (i) and cr y (i) denote the birth time and type of the individual y when i is 
viewed as the ancestor. We then have t x = Tj + T y (i) and a x = u y (i). Multiply P s '(du/) 

by - — hil) 17 ^ t° obtain 
h[s) 



h{s) £ h{o- t ) ax 



T ^(dw (j) ) 



h(s) £ /i(o-j) 

= T^P s (dc ) • ^J^il .^(d^W;,,) -n^Cd^) 

=pi(d W / ;a; ), 

by (4.2). Finally, sum over x €lt to get 

where uj 1 is suppressed, but understood as the argument of t x , a x and Wt ■ □ 

To summarize, restricted to the a- algebra J 7 *, the size-biased population measure P s 
relates to the regular population measure P s via the Radon-Nikodym derivative W t = 

dP* 

■Tpt, a relation that is a straightforward extension from Olofsson (1998), which, in turn, 
is the straightforward extension of the original LPP method. This result is also similar 
in nature to Proposition 1 in Athreya (2000), which deals with a different martingale. 

The individuals Vq,Vi,... in the spine are of particular interest in our analysis. From 
now on, we will use <7o,<7i, . . . to denote the types of the individuals in the spine. The 
inter-arrival times are denoted Ti,T2, . . . that is, T k is the time between the appearances 
of the (k — l)th and kth individual. In the single-type case treated in Olofsson (1998), the 
individuals in the spine have lives that arc i.i.d., but the situation is now much different, 
with dependence between consecutive individuals introduced via types. As we shall see 
later, this dependence is also what prevents the proof of necessity of the a; log a; condition 
from carrying over from the single-type case. 

We now state important properties of the sequences of types and inter-arrival times 
in two lemmas. The first lemma deals solely with the type sequence. For the rest of 
this section, we use the notation P rather than P s since the conditional probabilities we 
consider do not depend on the initial type. 

Lemma 4.4. The sequence of types (<7o,<7i, . . .) in the spine is a homogeneous Markov 
chain with transition probabilities 

h(r)^ 

P(a k+1 e dr\a k = s) = ——^,(s, dr) 
h(s) 



1296 

and stationary distribution u(ds) = h(s)ir(ds). 



P. Olofsson 



Proof. In a generic reproduction process £, denote the birth time and type of the ith 
offspring by r(i) and o~(i), respectively. Note the difference between a(i) and o~i, the latter 
being the type of the ith individual in the spine. The transition probabilities satisfy 



5<r{i) (dr) 



P(a k +i £ dr\a k = s) 

= ^-P(<Tfe+i £ dr,w fe+ i =i[<Tfe = s) =y^-E s 

£ i 
1 1 f 00 

^^S s [e-^ i )/ l (a(i))<5 CTW (dr)] = ^ ^ E a [e-°*h(r)t(dr x di)] 
^(s,dr), 



Mr). 



where we have used Lemma 4.2 applied to the function 

e- QT *M^) 



9 



(o=E : 



Next, let i/(ds) = ft,(s)7r(ds). As 



we get 



/2(s, dr)7r(ds) = 7r(dr), 
-/x(s, dr)^(ds) = v(dr) 



IseS h ( s ) 

and thus the Markov chain of types in the spine has stationary distribution v = hdir. □ 

The second lemma deals with the sequence of types and inter-arrival times of the 
individuals in the spine. 

Lemma 4.5. The sequence of types and inter-arrival times (o~o,Ti,o~x,T2, . . .) of the 
individuals in the spine constitutes a Markov renewal process with transition kernel 



P{T k+ i G dt,a k +i £ dr\a k = s) 



^4e" Q V(s,dr x dt) 
h{s) 



and the expected value of T k when erg ~ v is 

E 1/ [T k ]=/3<oc, 

where /? was defined in (2.3). 
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Proof. Similarly to the proof of Lemma 4.4, we get 
P(T k+ i G dt,a k+1 G dr\a k = s) 

= 2J P( T k+i e dt, (Tfc+i G dr, v k+ i = i\a k = s) 

i 

1 -E s [c- at h(r)£{dr x di)] 



h( s y 

-e~ Q V(s,dr x dt) 



MO -a. 



h(s) 

and the expected value of T k when <7o is chosen according to the stationary distribution 
v = h dir is 

E v [T k ] = [ t^f\e- at Li(sAr x dt)h(s)ir(ds) 



te- at h{r)ii(s, dr x dt)Tr(ds) = /3, 

I Sx [O.oo) 

by (2.3). □ 

There is an interesting connection between the size-biased measure and the stable pop- 
ulation measure from Jagers (1992). The latter is an asymptotic probability measure that 
is centered around a randomly sampled individual as t — > oo. In such a stable popula- 
tion, the randomly sampled individual is born in a point process that has the size-biased 
distribution, the asymptotic type distribution as time goes backward through the indi- 
vidual's line of descent is hdir and the asymptotic mean age at a random child-bearing 
is /?. The transition probabilities in this backward chain also involve /I(s,dr), but have 
weights that are expressed in terms of tt rather than h, as we have in the size-biased 
measure where time goes forward. This relation becomes clearer in a finite-type Galton- 
Watson process where ir and h are simply the left and right eigenvectors of the mean 
reproduction matrix. 



5. Sufficiency of the x\ogx condition 

We will soon be ready to prove the general a; log a; theorem, Theorem 2.1, the key to 
which is the relation between P s and P s . Recall that the two are related through Wt, 
which is a martingale under P s and a submartingalc under P s . The following lemma 
relates the limiting behavior of Wt under Ptt(-) = j s P s (-)7r(ds) to its limiting behavior 
under P v {-) = f s P s (-)h(s)n:(ds). 

Lemma 5.1. Let W = limsup t Wt. Then, 
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(i) P V (W = 00) = 0=>E*[W] = 1; 

(ii) P V (W = 00) = 1 => E w [W) = 0. 

Proof. By Durrett (2005), page 239, 

P V (A) = E V [W; A] + P U {A C){W = 00}). 
If P V (W = 00) = 0, then we have P V {A) = E V [W; A] and get 

P v (W = 0) = E v [W ; W = 0] = 0. 

Hence, 

E v [W] = E v [W : W > 0] = P u (W > 0) = 1. 

Moreover, as 

1 = E V [W] = ( E s [W]u(ds) 
Js 

and as Fatou's lemma implies that -E S [W] < 1 for all s, we must have £?s[W] = 1 for v 
almost all s G S. As it <C v, we also get £V[W] = 1. 

Next, suppose that P V (W = 00) = 1. As W is an a.s. finite martingale limit under P s , 
for 7r-almost all s € S, we have 

P V {W = 00)= / P S (W = oo)h{s)n{ds) =0. 
Js 

Hence, the measures P v (-) and E V \W\ ■] are mutually singular and, as P„(W > 0) > 0, 
we get 

E V [W] = E„[W;W >0] = 0, 
which implies that E v [W] = as well. The proof of the lemma is thus complete. □ 

Another connection between expected values under P w and P Vl directly involving the 
a; logs condition, is given by the following corollary to Lemma 4.2. 

Corollary 5.2. 

KK"iog + £H^[iog + e 

Proof. Choose = log + f in Lemma 4.2 to obtain 

Js 

= f h(s)E s [\og+ ^Tr(ds) 
Js 
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The logical structure of the proof of Theorem 2.1 is as follows. Assume that 
E-r [£log + £] < 00 ■ By Corollary 5.2, we then have E„[\og + < 00. If we can show that 
this, in turn, implies that W < 00 almost surely with respect to P v , then we can invoke 
Lemma 5.1 (i) to conclude that ^-[W] = 1, after which the proof is more or less complete. 
The gap is filled by the next lemma, which utilizes results for general branching processes 
with immigration. It has been mentioned that such processes play a vital role in the LPP 
method and, in the current setting, we observe that the individuals off the spine con- 
stitute a general branching process with immigration, the immigrants being the siblings 
of the individuals v\, V2, ■ ■ ■ in the spine in an obvious extension of the Galton- Watson 
case. To describe the immigration process formally, let Ij t k be the indicator of the event 
that Vj-i's fcth child is not chosen to become Vj, denote the immigration time of the jth 
immigrant by Tj and denote the birth time and type of the fcth individual in by T k {j) 
and crfe(j), respectively. The immigration process 7/ is 

r)(ds x dt) = ^2S (rh (j)(ds)8 Th (j)(dt-T j )I j ,k, 

which has 

fj = J2 h(a k (j))e- a ^e- a ^I jtk . (5.1) 

We are now ready for the last lemma needed for the proof of Theorem 2.1. 

Lemma 5.3. Consider a general branching process with immigration process rj as above. 
If -E„[log + £] < 00, then W = limt-yoo Wt exists and is finite P^-a.s. 

Proof. First, note that 

^[log + £] < 00 =S> V" P„(log + £ > era) < 00 for all c > 0. 

n 

Now, consider the sequence £o 5 £i) • • ■ f° r the individuals in the spine. Recalling that v is 
the stationary distribution of the Markov chain of types in the spine, we conclude that 
P„(log + £~ > era) = /^(log^ £~ n > era) for all ra, which gives 

y~]P„(log + £n > era) < 00. 

n 

By the first Borel-Cantelli lemma, we get 

P,(log + <e„ >cra i.o.) = 0, (5.2) 
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which, by (5.1), gives 

oo 

= V" c~ aT ^j < oo P u -a.s. 

since the Tj are sums of the Tk , which, being the regeneration times in a Markov renewal 
process, obey the strong law of large numbers (Alsmeyer (1994)) so that Tj ~ j3j almost 
surely as j — > oo (recall that E v \Tk] = (3 < oo). The last sum above is a.s. finite because 
of the subexponential growth of the established in (5.2). By Lemma 3.1, we conclude 
that lim t Wt exists and is finite P„-a.s. □ 

We now have in place all of the preliminaries needed to prove Theorem 2.1. 

Proof of Theorem 2.1. As .E^ [£log + £] = E v \[og + £] < oo, Lemma 5.3(i) implies that 
P V (W = oo) = and Lemma 5.1(h) gives £ ff [W] = 1. Moreover, as 

E„[W} = [ E s [W]ir(ds) 
Js 

and as Fatou's lemma implies that i? s [W] < 1 for all s, we must have -E S [W] = 1 for 
7r-almost all sgS. The proof is thus complete. □ 

6. Necessity of the x\ogx condition 

For single- type processes, the condition of having a finite x log x moment is both sufficient 
and necessary. Using the size-bias method, this can be established by using the first 
and second Borcl-Cantclli lemmas, respectively. However, in the multi-type setting, the 
main result in Jagcrs (1989) establishes only sufficiency and it is currently not known 
whether necessity holds. The method of size-biased branching processes provides a way 
of investigating necessity and although the second Borel-Cantelli lemma cannot be used 
due to dependence, more general versions can be employed. This section is exploratory 
in nature and does not provide any definite solutions, but rather outlines two different 
approaches to establish necessity under additional conditions through the conditional 
Borcl-Cantclli lemma and the Kochen-Stone lemma, respectively. 

By "the xlogx condition", we mean the condition that log + £] < oo. Hence, to 
establish necessity, we need to assume that E^[^ log + £] = oo and show that this assump- 
tion implies that -E^VF] = 0, invoking Lemma 5.1(h) in an intermediate step. The logical 
structure parallels that of the proof of sufficiency: if £V[£log + £] = oo, then Lemma 5.2 
yields that i?„[log + £] = oo, which implies that 

^P„(log + ^>cn) = oo. 

n 
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Since v is the stationary distribution for the types in the spine, this further implies that 

^2 -P^(log + In > cn) = co. 

n 

In the proof of sufficiency, this sum was finite and the first Borel-Cantelli lemma could 
be invoked to conclude that P„(log + £ n > cn i.o.) = 0, leading to the rest of the proof. 
However, as the reproduction process £ n +i is chosen according to a probability distribu- 
tion that depends on the type o~ n which is determined by the parent reproduction process 
£„, we cannot assume that the are independent. Hence, we cannot invoke the second 
Borel-Cantelli lemma to conclude that, almost surely, log + £„ > cn i.o. This constitutes 
a difference from the single- type case treated in Olofsson (1998), where sufficiency and 
necessity were established using the first and second Borel-Cantelli lemmas, respectively. 
Below, we establish necessity of the a; log a; condition under various additional conditions. 

The conditional Borel-Cantelli lemma states that if {J- n } is a filtration and {A n } a 
sequence of events with A n <G T n , then 

{A n i.o.}= |^F(A n |J"„_i) = oo|; 

see Durrett (2005). For us, A n = {log + £„ > cn}, the er-algebra T n -\ gives the type a n 
of the nth individual in the spine and we get 

P s (log+ l n > cn\ Tn-i) = ?a n (log + £ > cn). 

The question then becomes under which conditions 

oo 

P an (log + £ > cn) = oo P^-a.s. 

n=l 

given that 

oo 

^P„(log + e>cn) = oo. 

One more step is necessary in order to invoke part (ii) of Lemma 5.1, namely, to argue 
that log + f n > cn i.o. implies that W = limsup t Wt = co P„-a.s. We state this as a lemma. 

Lemma 6.1. // P v (log + £ n > cn i.o.) = l, then P V (W = oo) = 1. 

Proof. Consider W Tn , the value of Wt at the time of the arrival of the nth immigrant 
v„ . All of the children of this immigrant except the one chosen to become w„+i belong 
to X Tn , so these children form a subset of The fcth child is born at time r„ + r(fc) 
and has type a(k), where r(k) and cr(k) are the points in the reproduction process of 
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v n . Let Ik be the indicator of the event that the fcth child of v n is not chosen to become 
v„+i to obtain 

_ r oo 

where C = sup s h(s) < oo (Jagers (1989)) and we recall (2.7). Adjust the last part of the 
proof of Lemma 5.3 to conclude that W = limsup t Wt = oo P„-a.s. □ 

Our first result establishes necessity of the xlogx condition under the additional as- 
sumption that there is a type that is revisited infinitely often in the spine. 

Proposition 6.2. // the Markov chain of types o~q,o~i, . . . in the spine has one positive 
recurrent state r such that E r {£ log + £] = oo and if, for it -almost all starting types s, there 
exists k such that o~k=r a.s., then the xlogx condition is necessary. 

Note. The Markov chain is on a general state space, but here we use "positive recur- 
rence" in its elementary meaning, namely, that E r [T r ] < oo, where T r = inf{n >0:a n = 
r}. 

Proof of Proposition 6.2. The result follows from the following observation regarding 
infinite series. Let a n > be a decreasing sequence of real numbers such that a n = oo, 
let X\, X2, ... be i.i.d. non- negative and integer- valued random variables with finite mean 
[i and let T n = X x + X 2 H h X n . Then, 

00 

ax n = 00 a.s. 

n=l 

This holds because if k > fj, is an integer, then ax n > a n k a.s. for large n, by the strong 
law of large numbers, and, obviously, ^2 n a n k = 00 for all fixed k. Finally, apply this 
result to a n = P s (log + <^ > cn) with X\,X2, ■ ■ ■ being the consecutive inter-return times 
to the state s in the Markov chain of types in the spine (that is, the X^ are i.i.d. copies 
of T r above). □ 

One instance where the assumption of Proposition 6.2 follows from the xlogx condition 
is if the type space is finite. Indeed, if there are n types, we have 



00 = £.[eiog+ = J2 £U£"log + £Mr 



fc=l 
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which implies that we must have E r [£ log + £] = oo for at least one r. In particular, this 
establishes necessity of the x log x condition for the ordinary multi-type Galton- Watson 
process with a finite type space (under the usual assumptions of positive regularity and 
non-singularity; see Athreya and Ney (1972)). 

Another approach is to consider the rate of convergence of the type chain ao,a\,. . . 
toward its stationary distribution v. To simplify the analysis, let Y — [log + £], the integer 
part of log + £, which has finite mean under P v if and only if log + £ does. We then have 
the following. 



Proposition 6.3. Suppose that 



lj2P° k {Y = n)-PAY = n) 



k=l 



< OO. 



The xlogx condition is then necessary. 
Proof. The condition in the proposition implies that 
1 ™ ~ 

71 * 



n>l 



k=l 



< oo Py-a.s., 



k=l 



which yields 

oo oo 

j2p <7k (Y>k)=j2Y, p °>*( Y = n ) 

k—l n>k 
n>l \ k=l 

> y~]nP v (Y = n) - E' 



n>l 
OO 



1 " ~ 

-J2P ak (Y = n)-P u (Y: 
a * — » 



k=l 



since the first term equals E^y], which is infinite if E„[log + £] is infinite, this being the 
case if the xlogx condition does not hold. The second term is finite by assumption. □ 



Note that if {a n } is Harris recurrent (Alsmeyer (1994), Durrctt (2005)) with stationary 
distribution v, then the ergodic theorem yields 

1 n ~ 

-E p ^( y =-?)^ p "( r =-?) p -- a - s - 

n ' — ' 

k=l 

for all j, so our condition means that this convergence is, in some vague sense, "fast 
enough" . 
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Another generalization of Borel-Cantelli is the Kochen-Stone lemma that states that 
if E n P ( A n) = °°, then 

P(A n i.o.) > limsup : { ^ =1 P{Ak)}2 



We can apply this to prove the following result. 

Proposition 6.4. Let A n = {log + £„ > cn}. If the (indicators of the) A n are pairwise 
negatively correlated, then the a; log a; condition is necessary. 

Proof. Because P v (Aj n4)< P v (Aj)P v (Ak) , we get 

Pu(A n i.o.) > limsup E '^ 1 ^ (Afc)}2 > 1. 

» Ei< j>k <n PAAjnAk) □ 
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